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Abstract —This paper presents a framework to evaluate the 
probability that a decision error event occurs in wireless sensor 
networks, including sensing and communication errors. We 
consider a scenario where sensors need to identify whether a 
given event has occurred based on its periodic, noisy, observations 
of a given signal. Such information about the signal needs to be 
sent to a fusion center that decides about the actual state at that 
specific observation time. The communication links - single- or 
multi-hop - are modeled as binary symmetric channels, which 
may have different error probabilities. The decision at the fusion 
center is based on OR, AND, A-OUT-OF-iV and MAJORITY 
Boolean operations on the received signals associated to individ¬ 
ual sensor observations. We derive closed-form equations for the 
average decision error probability as a function of the system 
parameters (e.g. number of sensors and hops) and the input 
signal characterization. Our analyses show the best decision rule 
is closely related to the frequency that the observed events occur 
and the number of sensors. In our numerical example, we show 
that the AND rule outperforms MAJORITY if such an event is 
rare and there is only a handful number of sensors. Conversely, 
if there is a large number of sensors or more evenly distributed 
event occurrences, the MAJORITY is the best choice. We further 
show that, while the error probability using the MAJORITY rule 
asymptotically goes to 0 with increasing number of sensors, it is 
also more susceptible to higher channel error probabilities. 

Index Terms —Data fusion, distributed detection, wireless sen¬ 
sor networks 


I. Introduction 

In a recent report, the consulting group McKinsey claims 
that the Internet of Things (IoT) - “sensors and actuators 
connected by networks to computing systems” - have “a total 
potential economic impact of $3.9 trillion to $11.1 trillion a 
year by 2025” [1]. Although these numbers and methodology 
might be questionable, the fact that wireless sensor networks 
(WSNs), which build the core of IoT, are becoming widespread 
in almost any possible application area, ranging from energy 
systems to sleep monitoring [2], 

This wide variety of uses implies the nonexistence of a given 
optimal technology solution that fits all WSNs (e.g. [2], [3]); 
the requirements of control applications in industries (e.g. [4]) 
is far different from in household monitoring (e.g. [5]) and the 
WSN deployment should reflect these differences. 
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In this paper, we will focus on general applications with 
loose reliability requirements so that sensors may have limited 
computational capabilities. Our goal here is to understand how 
to build a more efficient WSN to detect a given event based on 
an input signal, keeping its design as simple as possible. We 
follow here the idea of distributed detection (e.g. [6], [7]) so 
that every sensor estimates the occurrence of such an event and 
then sends this information to a fusion center, which decides 
based on the locally processed data; in contrast, by employing 
a centralized approach, the sensors need to send all their raw 
observations to the fusion center that then makes a decision. 

In signal processing and information theory [8], the field 
of distributed detection and estimation, which has a relatively 
long history (e.g. [8]—[ 13]), builds an elegant theoretical 
framework for finding optimal (locally and globally) strategies. 
In a series of works dating back to the 80’s and 90’s, the core 
theoretical findings of distributed detection were established, 
as summarized in [11]. Therein, the authors reviewed the 
advances on the topic, pointing out three different network 
topologies: parallel (sensors are not connected to one another), 
serial (sensors connected in series) and tree (sensors connected 
following a tree hierarchy). Different formulations for the 
detection problem have been then described and their optimal 
solutions discussed. 

For instance, the Neyman-Pearson formulation poses the 
problem as follows [11]: “for a prescribed bound on the global 
probability of false alarm, find (optimum) local and global 
decision rules that minimize the global probability of miss.” 
Other way of formulating the problem is based on Bayesian 
statistics aiming at minimizing the Bayes risk. Although 
they differ in basic aspects, both search ways of optimizing 
the detection rules based on binary hypothesis tests on the 
presence of a given signal and they state the likelihood-ratio 
test as the optimal rule (with different parameters, though). For 
these cases, the analyses are usually carried out in terms of 
false positives (also known as type I error, or false-alarm) and 
false negatives (also known as type II errors, or misdetection). 

Even though these results have been established for decades, 
there is still a great interest in distributed detection. Recently, 
Zhang et al. studied in two subsequent papers [14], [15] the 
detection error probability in balanced binary relay trees. The 
leaves of the trees (lowest level) are related to the sensors 
while the root (highest level) is the fusion center that makes 
the decision. In between them, relay nodes combine the binary 
messages sent by their two neighbors at the lower level (either 
sensors if we consider the second lowest level or other relays 
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for the other levels). Using the likelihood test at every level, 
they derive tight bounds of the error probability as a function 
of the number of sensors considering the binary symmetric 
channel [14] and erasure channel [15]. In [16], the authors 
assess the probability of decision error of a network with noisy 
binary symmetric channels between sensors and fusion center 
and points out that the fusion rule should be optimized with 
respect to the observed phenomenon. 

In [17], the authors combine the idea of robust decentralized 
detection with social learning. Among its contributions, [17] 
generalizes to tree and tandem topologies the results of [18], 
where a solution to the “minimax” robust detection problem 
(whose objective is to minimize the worst case performance 
when the probability distribution of the observations is not 
completely specified) for parallel topology is presented. In 
a recent paper [19], the authors combine the advances in 
spatial models for wireless networks using spatial point pro¬ 
cess theory (e.g. [20] and references therein) and distributed 
detection. In [21], decentralized detection for clustered sensor 
networks with hierarchical multi-level fusion is investigated. 
Authors conclude that the probability of decision error is 
dominated by the number of decision levels rather than the 
clustering formation, which renders minimum performance 
degradation with uniform clustering. In addition to wireless 
sensor networks, distributed detection has been applied in the 
analysis of spectrum sensing strategies for cognitive radio 
networks as in [22], 

The field of distributed estimation, although similar, has 
received relatively less attention. We can cite the following 
papers as initial works on that problem [9], [10], [23]. More re¬ 
cently in [24], [25], the authors studied the case of distributed 
estimation with constrained bandwidth of 1 bit, proposing a 
class of maximum likelihood estimators. Other relatively new 
results can be found in [26], [27], while [13] provides an 
interesting survey on the topic. 

In this paper, we choose a slightly different way by ana¬ 
lyzing a scenario where the quantization and decision rules 
are given, instead of seeking for optimal detection/estimation 
schemes [7], As previously mentioned, our aim is to assess the 
average error probability of a WSN with little computational 
capabilities. Specifically, our scenario is a set of sensors 
that periodically measure a given signal to detect whether 
a given event happens (e.g. if the signal has a value above 
a certain threshold). Based on their noisy measurements, the 
occurrence of such events is mapped into a binary number (e.g. 
occurrence implies “1”, not occurrence implies “0”), which 
defines the sensor state. The sensors need then to send their 
states to the fusion center via wireless channel, through one or 
multiple hops. The relay nodes only forward the information 
they received. We assume binary symmetric channels whose 
associated error probabilities might be different at each level 
of the multi-hop transmission (but, within the same level, the 
probabilities are the same). A decision about the state of the 
signal is done at the fusion center based on the binary signals 
related to each sensor. The decision rules employed by the 
fusion center are the memoryless Boolean functions OR, AND, 
K-OUT-OF-N and MAJORITY. 

Our study targets at answering the following: Under which 


conditions a low average error probability can be achieved for 
the scenario described above? For example, the answer for this 
question for the case of rare events is that a combination of 
AND decision rule by the fusion center and quantization of the 
event occurrence with “1” leads to an average error probability 
close to the event frequency itself, even when only a handful 
of sensors are used. A deeper discussion about this is found 
later in this paper. It also important to mention that our work 
differs from others in the literature as [16], [21] by focusing on 
a single error probability metric rather than cross-over, false¬ 
positive or false-negative probabilities. More specificaly, the 
present work generalizes [16] to multi-hop binary symmetric 
channels and general fusion rules. 

In summary, we identify the following main contributions: 
(i) propose an analytic framework that first breaks the WSN 
into three phases - sensing, communication and decision - to 
analyze the error events related to each one of them (Section 
II) to then rebuild the system as a whole to understand how the 
error propagates through those phases (Section III), regardless 
of the input function; (ii) find a joint error probability which 
accounts for sensing and communication, while considering 
that the sensor observations (which are conditionally indepen¬ 
dent) are subject to Gaussian noise and independent binary 
symmetric channels with different error probabilities at each 
hop (Section II); (Hi) derive, in closed-form, the average error 
probability for the OR, AND, K-OUT-OF-N and MAJORITY 
(Section III), showing that MAJORITY rule can asymptoti¬ 
cally reach a 0-error probability with the number of sensors; 

( iv ) show that the performance of OR and AND rules depends 
on the frequency of the event under analysis, while the other 
two do not - this fact implies that, when a limited number 
of sensors is considered, OR or AND rule (depending on 
the quantization mapping) can outperform the MAJORITY; 

(v) exemplifying the analysis with numerical results (Section 
IV) to illustrate our findings, providing the basis for our final 
discussions and possible extensions (Section V). 

II. System description 

Let us assume a network composed by a set Af = {1,..., N} 
of sensors that monitor a continuous signal x(t), where t e 
]R + and x : R + —» R, to estimate whether a given event £ 
related to x(t') happens and send this information to a fusion 
center. To avoid confusion, we summarize the key notations 
in Table I. Assuming that the sensors make synchronous and 
periodic measurements in predetermined instants t n = nr with 
n being a natural number and r e R + , we can define a function 
0[?r] with 0 : TT —» {0,1} that indicates if £ occurs at time t n . 
Hereafter we refer to 0\n\ as the system state at time t n . 

The sensors’ estimation of 6\i£\ is, however, imperfect. For 
each sensor i e Af, we define a function y, [n] with : N —> 
{0,1} that represents the estimation about £ from its individual 
noisy version of x(t). If a sensing error at sensor i happens 
at t n , then yi[n\ A 0[n]; otherwise [n] = 0\n\. 

After the sensing phase, the sensors need then to forward 
yi[n\ to a remotely located fusion center, which will process 
the received information to determine whether £ has indeed 
occurred. Each sensor i e Af sends its state yi[n] through 
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independent communication channels that are also subject to 
errors. Let Sj[n] with s* : N —> {0,1} be the state related 
to sensor i that is received by the fusion center after passing 
through the radio links, which can be composed by only one 
hop or multiple hops where relay nodes forward the received 
information. If an error occurs in the link related to sensor i’s 
?rth measurement, then s t [n] / y, [n]; if not, s.;[n] = yi[n]. 
From the signals ,Sj [n], the fusion center needs to decide 
whether event £ happened at t n . Let </(si[n],..., s,/v[n]), with 
g : {0,1}^ —» {0,1}, denote the Boolean function that esti¬ 
mates the state 9[n] by the fusion center so that the estimated 
state 0\n\ related to t n is given by 0[?r] = g(si[n],..., Sjv[n]). 
A decision error occurs whenever 9\n ] A 9[n]. The average 
error probability P e of the whole process is given by: 


v-1 


P e = Y! Pr 9[n\ A 9[n] 

71 


? ? n=0 


( 1 ) 


where the average is taken over the samples ne { 0 , 1 ,..., 77 — 1 } 
related to a time window from 1 0 = 0 and t v _i = T. 

Our goal here is to analyze different design options for the 
sensor network and the fusion center’s decision function to 
improve the estimation reliability, evaluated by the probability 
that 9\n\ A 9\n], Fig. 1 illustrates the scenario under analysis. 
For instance, x(t) may represent the temperature of an indus¬ 
trial plant that requires temperatures below a given threshold 
x± to guarantee its safe operation. The event £ can be then 
associated with an emergency where x{t n ) > x Given the 
signal x(t ), the threshold x t h and the number of sensors TV, 
we need to find the most suitable design for the quantization 
function 9 (i.e. define if the event x(t) > x t h is associated 
with 9 = 0 or 9 = 1) and the decision rule g (OR, AND, 
A'-OUT-OF-iV, MAJORITY) at the fusion center. 

Next we will focus our attention on the errors in the sensing 
procedure and in the communication links, which are identified 
by the dashed rectangle in Fig. 1, and how they affect the 
decisions done by the fusion center. 


TABLE I 

Notation summary 


Notation 

Description 

AT 

set of sensors 

N 

number of sensors in set J\f 

M 

number of communication hops 

7 

total number of samples 

x(t) 

continuous signal as a function of time t 

Xth 

signal threshold 

e[n] 

sensor state at time t n related to the nth measurement 

ln[n\ 

sensor i estimation about the nth measurement 

£ 

event related to x(t) 

Si|n| 

received state at the fusion center from sensor i 

9k) 

decision function 

e[n\ 

estimated stated at the fusion center 

nfitn) 

additive noise associated with sensor i at t n 

U 

noise mean 

a 2 - 

noise variance 

S 

Boolean variable associated with event S 

fs 

frequency of a given state Ss 

BiJ 

state of sensor i in hop j 

Pj 

error probability in the communication hop j 

Cm 

set of all possible error events for M-hop link 



0[n] = s(si[n], 


Fig. 1. Illustrative figure of the scenario under analysis. Sensors monitor a 
given signal x(t) in order to determine the binary state 0[n] at time t n • Each 
one of the N sensors in the network needs to send its state to a fusion center 
(control unit) that remotely decides the state 6[n\. In its way to the fusion 
center, errors may happen either in sensing ( yi[n] # 0[n] with i = 1,.., N ) 
or in communicating (s^fn] yi[n]). The dashed rectangle identifies where 
the error events may happen. Our goal is to find the expected error probability 
P e given by (1) and then compare different design options. 

A. Sensing error 

Let us denote Xj(i„) the version of x(f) observed by sensor 
i e J\f. The value of Xi(t n ) will be then used to define yi[n\. 
Then, we can define the probability P(xj(f„) < x t h) that the 
event Xi(t n ) ^ a;*, and its complement P(xi(t n ) > x t h). 

Remark 1: The sensing error probability is dependent on 
the input signal x(t) such that an error in the sensing procedure 
yi[n\ A 0 [?r] occurs in two situations: (a) x(t n ) > x t h and 
Xi(t n ) < x t h, or (b) x(t n ) < x t h and Xj(f„) > x*. The error 
probability is then related to the frequency that x(t, n ) is above 
or below the threshold x t h, which is captured by how many 
times 9[n] = 0 or 9[n] = 1 for n = 0, — 1. 

Let us consider x(t n ) < x* is associated to the state 9[n] = 
S where S e {0,1} and x(t n ) > x t h is associated to the state 
9[n] = S, where S denotes the complement of S. Then, we 
have the following definition. 

Definition 1: Recalling that // denotes the total number of 
samples within an arbitrary interval defined by to = 0 and 
t v -1 = T. We define = y x( t n )^x , h and y s 9x(t n )>X th 

as the number of samples related to states S and S in such 
interval, respectively. In this case, rjs + Vs = V- The frequency 
fs = f x (t n )^x , h that the state S appears between to and t v -i is 
fs = Vs/{Vs + Vs) Similarly, the frequency f s = / x (t n )>x th 
is fs = Vs/ (VS + VS)- 

B. Communication errors 

Let S'i j o[n] = yi[n\ be the signal sent by sensor i and 
Si : j[n ] be the state at the jth level with j = 1 
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Fig. 2. State probability tree for each sensor i = considering 

communication error probabilities pj with j = 1 where M is the 

number of hops between the sensors and the fusion center. The initial state 
is S if o = Vi\n\ and the final state is Si m- The state Si j E {0,1} and Si j 
denotes its complement. 


Pj 



Fig. 3. Transition diagram between the two possible states “0” and “1” 
when the Binary Symmetric Channel is assumed. A change only occurs with 
probability pj with j = 1,..., M, which is associated with the jth level error 
probability such that Si j = Sij—i with probability 1 — pj and Si j # 
Si j—i with probability pj. 

communication hop. At every hop, relay nodes forward their 
state Sij[n] to the next one. The probability tree of the state of 
sensor i is presented in Fig. 2, where the initial state S^ol/f] is 
equal to the state y t [n] after the sensing procedure and S i ;) [n] 
is the state at the jth level. 

We assume here a binary symmetric channel where the 
output is different from the input with probability p :) for the 
jth level of the tree. We assume that the error events are 
independent at each level and time-steps t n , which allows 
for dropping the index n. Then, the state S, : j = with 

probability 1 — pj and S, t j A Sij -1 with probability pj. Fig. 
3 represents the state transition diagram of this channel. 

Definition 2: A communication error event ,s,;[nj A j/i[n] 
related to sensor i and t n occurs after M hops if an odd 
number of errors Sjj ¥= Sij i had happened for j = 1,..., M. 

Definition 3: Let us define the set of all 

possible error events for an M-hop link as 

Cm = {{ }i {1} 5 {2}, {1,2}, {1,3},..., {1,2,.,,, M}}, which 

contains 2 M elements that refer to the index j = 1 
Let Cm, odd c Cm denote the subset of index with odd 
cardinality, which is composed by 2 A1 1 elements. The 
subset C^jj odd c Cm, odd with k = 1,,..,2 M_1 denotes each 
one of the k subsets of C M ,odd such that IJ C k M (M = C M ,odd- 
The complement C\ rJ odd is defined in relation to {1,2,..., M} 
such that C\ IodA + c\ I odd = {1, 2,..., M }. 

Let us now exemplify the construction of those sets when 
M = 3. Following the procedure presented in Defini¬ 
tion 3, C 3 = {{ },{!}, {2}, {3}, {1,2}, {1,3}, {2,3}, {1,2,3}}, 
which has eight (2 3 ) elements. The subset is then C 3 , 0 dd = 
{{1}, {2}, {3}, {1, 2,3}}, having then four (2 2 ) elements. Us¬ 
ing these sets, we have C} ndH = {1} and C\ nHd = {2,3}, 
Cf, odd = i 2 } and C “I,odd = {1,3}, C| )0dd = {3}' and C 3 odd = 


{1, 2}, and C 3 4 odd = {1, 2, 3} and C 4 odd = { }. 

Theorem 1: The communication error probability 
P(si[n ] ^ Vi\tt\) related to sensor i and t n over M 

hops is given by: 

P(si[n] / yi[n]) = J] ( Yl Pi J ( Yl 

4 1 \* £< -M,odd / odd J 

( 2 ) 

Proof: From Lemma 2 we know an error event [ | # 
yi[n\ occurs whenever an odd number of error events ,Sjj ^ 
Sij-i happens for j = 1 To compute the probability 

of such events, we need to use the law of total probability 
knowing that the error events at each level and in different 
measurement instants t n are independent. Using Definition 3 
to characterize the subsets containing the odd number of error 
events, we obtain (2). ■ 

Remark 2: If pj = p for all j = 1,..., M, then 

1 ff o n) M 

P(si[n] A Vl [n]) = - 1 2 P> , (3) 

which is known result for cascade of binary symmetric chan¬ 
nels [ 8 ]. 


C. Decision function 

Lemma 1: A decision error event 9[n] A 9[n] at fusion 
center depends on the actual state 9[n] and the decision 
function 9\n\ = g(si[?r], ..., Sjv[n]) as follows. 

. OR: If 0[n\ = 0, then at least one signal s, [n] = 1 with 
i £ Af leads to 9\n\ = 1 ¥= 9\n\. If 9\ri\ = 1, then an 
error event 9\n\ = 0 A 9[n\ only occurs when all signals 
s, ; [n] = 0. 

. AND: If 9[n] = 0, then an error event 0[?r] = 1 A 9\n\ 
only occurs when all signals s, [n] = 1. If 9\n\ = 1, 
then at least one signal s,; [n] = 0 with i e Af leads to 
9\n] = 0 ¥= 9\n\. 

• /i-OUT-OF-V: If 9[n] = 0, an error event 9[n] = 1 ¥= 
9\n\ occurs when at least K out of N signals Si[n] = 1. 
If 9[n] = 1, an error event 9[n] = 0 ¥= 9[n] occurs when 
at least K out of N signals s, [n] = 0. If N is an even 
number and K = N/ 2, the event where N /2 signals are 
s, [n] = 0 and the other N/2 are ,s t [n] = 1 may occur 
and the decision will be randomized such that 9[n\ = 0 
or 0[n] = 1 with 50% of chance. 

. MAJORITY: This rule is a specific case of A'-OUT-OF- 
N when K = [-/V/2], where [a] : R —» Z represents the 
ceiling function such that it maps the real number a to 
its smallest following integer. 

Corollary 1: The AND and OR decision functions are 
conditional versions of /\-OUT-OF-A ? rule as follows. 

. OR: If 9[n ] = 0, the error event is equivalent to 1-OUT- 
OF-.ZV rule. If 0\n\ = 1, it is equivalent to N-OCT-OF-N. 

• AND: If 9[n] = 0, the error event is equivalent to N- 
OUT-OF-IV rule. If 9[n\ = 1, it is equivalent to 1-OUT- 
OF -N. 
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III. Main results 

In this section we combine the results previously presented 
to derive the main results of this paper, namely the average 
error probability given in (1) for the OR, AND, A'-OUT-OF- 
N and MAJORITY decision rules as a function of the number 
of sensors N, number of hops M and other system variables. 

Lemma 2: The probability P e ,K,s[n] that K out of N 
sensors experience error events s^fn] / 9\n\ at x(t n ) for 
x(t n ) ^ :c t h (i.e. 9\n\ = S) is: 

Pe,K,s[n] = (4) 

where the probabilities P s g[n] = P(x(t n ) < x t h)P(s.i[n] ^ 
Vi[n]) + P(x(t n ) > a: th )T , (s i [n] = y^n}) and P s ,s[ n ] = 
1 - Ps,s L n ] • 

Similarly, the probability P e K g [ n | that K out of N sensors 
experience error events s.;[n] / 9[n ] at x(t n ) for x(t n ) > x t h 
(i.e. 9[n] = S ) is: 

Pe,K,si n ] = (-^§,S'[ n ])’ R: (-Ps,5[ n ]) JV K (5) 

where the probabilities Pg s [n] = P(x(f„) > Xth)P(sj[ti] / 
Vi[n]) + P(x(t n ) x* )P(si[n] = y^n ]) and Pg.sM = 
1 _ A,,S'L n ]- 

In other words, P e ,K,s[n] represents the probability that the 
signals related to K sensors are in state S and N — K sensors 
are in state S when arriving at the fusion center, given that 
9[n\ = S (i.e. x(t n ) x th ). 

Definition 4: Let Av(-, •) be the average operator such that 
Av(u[n], V) = 1 v \- n l’ (6 ) 

where v\n\ is a list of numbers, #(u[n]) represents its 
cardinality and V is the set containing the indexes related to 
each one of the #(u[n]) elements of v\n\. 

Theorem 2: If the state S = 0 (i.e. 9\n\ = 0 if x(t n ) ^ x*; 
refer to Definition 1) the expected decision error probability 
P e , or,o introduced in (1) using OR decision rule and N 
sensors is 


P e . or,o = /oAv(l - P e ,iv,oN,«5o) + /iAv(Pe,jv,i[n],5i), 

(7) 

where /o and /i are given in Definition 1, and So and Si 
denote the set containing the indexes related to S' = 0 and 
5=1. 

Similarly, the probability P e and.o using AND decision is: 


Pe, AND,0 = /oAv (P e) jV,o[ri])«So) + /lAv (1 — Pe,JV,l[ rl ])«Si) . 

( 8 ) 

For the A-OUT-OF-A' rule except when both N is even 
and K = N/2, the probability P e ,K ,o is: 

N 


Pe,K, 0 — Yi /oAv(P e)fe> o[n],S 0 ) + /iAv(P e>M [n],Si). 


k=K 


(9) 


For MAJORITY and N odd, the probability P e 


odd 

e, MAJ ,0 


is: 


p. 


odd 

e, MAJ ,0 


N 

^ /oAv(Pe,fc,o[n],5o)+/iAv(P e ,fe,i[n],Si). 


fc = [IV/2] 


(10) 


For N being even, the error probability P c 6 m aj o * s: 

A. e M n Aj >0 = n,°MAj )0 - (y Av (P e>N/ 2 fi [n],So) + 

+ T^Av (P e ,AT/2,l[u], <?i)^ . 


(ID 


If S = 1 such that 9[n ] = 1 is associated to the x(t n ) < 
x t h, then P e or,i = Pe, and,o, A, and.i = A, or,o, A.at.i = 
Pe,K,0 an d A, MAJ.l = A, MAJ.O- 

Proof: To compute the average error probability P e given 
in (1), we first need to compute the probability of error events 
for each decision rule (described in Lemma 1), knowing the 
value of 9[n], We then use the fact that 9[n] = 0 if x(t, n ) < x t h 
(i.e. S = 0), Lemma 2 and Corollary 1 to find probabilities 
for every measurement. To obtain P e , we compute the average 
error probabilities for 9[n ] = 0 or 9[n ] = 1 using Definition 
4 and their respective frequencies /o and f\ (Definition 1). 

By De Morgan’s law, which says that A+B = A ■ B for any 
Boolean variables A and P, we find that P e or.i = P e , and,o 
and P e and.i = A, or.o when considering S' = 1 is related 
to x(t n ) A x th . The A'-OUT-OF-A and MAJORITY rules, 
in turn, are actually independent of how S is assigned so 
Pe,K,S=0 = Pe,K,S=l an£ l A, MAJ,1 = A, MAJ.O- ■ 


Remark 3: These results can be written in terms of type I 
(false-positive) and type II (false-negative) errors. For S = 0, 
the type I error probability is given 1 : 

. OR: Av(l-Pe,jv,oM)«5o), 

. AND: Av(P e! jv )0 [n],So), 

N 

. Af-OUT-OF-iV: £ Av (P e , M [n],S 0 ). 

k = K 

Similarly, the type II error probability is given by: 

. OR: Av(P e ,jv,i[n],«S 1 ) ) 

. AND: Av(l-P e)JV ,i[n],Si), 

N 

. A'-OUT-OF-A^: 2 Av(A.fc, 1 [n],5 1 ). 

k=K 

Corollary 2: The asymptotic behavior of the error proba¬ 
bility P e with N for S = 0 and the different rules is: 


lim P e , or,o — Jo, (12) 

iV—► 00 

lim Pe, AND.O = /l, (13) 

AJ—>oo 

lim P e ,A-<rjv/ 2 l,o = 1> (14) 

lim P e ,K^\N/ 2 ],o = 0, (15) 

N—>oo 1,1 

lim P e . maj,o = 0. (16) 

N—xX) 


Remark 4: When N —> oo and S = 0, the type I error 
probability for OR rule tends to 1, while the type II to 0. 
Conversely, when AND rule is considered, the type I error 
probability for OR rule tends to 0, while the type II to 1. 
Therefore, OR always tends to decide 1, while AND 0. 

*The results for MAJORITY is a special case of A'-OUT-OF-IV. 
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Fig. 4. Numerical example of the proposed scenario for the signal x(t) = sin( 1 2irt/t]) + cos(20nt/ij) + siii(2fj7r/:/7/) + 3, which has a mean value of 
3 and i] = 300, .x,u = 4.5 (represented by the dashed line in the first and third plots, and leading to ?;o = 262 and //i = 38), N = 3 (three sensors) and 
M = 1 (one hop). Sensor i sets its state y;[n] at t n based on the noisy version of signal denoted by = x(t n ) + rii(t n ) where n;(t n ) is related to an 

additive Gaussian noise with mean /r = 0 and variance tr 2 = 1. The error probabilities associated with the sensing procedure are given in 17 and 18. After 
the sensing procedure, signal yi[n] is sent via a binary symmetric channel with error probability p\ = 0.1, yielding a new signal s;[n], At the fusion center, 
a decision is done based on Si[n] and the logic operations OR, AND and MAJORITY. Table II presents the error probabilities associated with this scenario. 


IV. Numerical Analysis 

To illustrate our framework in a specific setting, let us 
consider the noisy version of x{t) observed by sensor i such 
that Xi(t n ) = x(t n ) + rii(t n ) where rii{t n ) is related to an 
additive Gaussian noise with mean y and variance a 2 , defining 
yi[n\. Then, the probability P(xi(t n ) ^ at th ) that the event 
Xi{t n ) < occurs is: 

P(xi{t n ) a;*) = ^1 + erf ^ —. (17) 

Similarly, the probability P(xi(t n ) > a;*) is: 


> atth) 1 


1 

2 


^1 + erf 


24h 


x{t n ) 

crv/2 



We study in this section the effect of the number of sensors 
TV, the number of hops M and the channel error probability 
for an input signal x(t) = sin(127r t/rf) + cos(207rf/?7) + 
sin(267rf/r)) + 3 with 77 = 10 4 and different thresholds x±. We 
assume that the sensors’ observations are affected by additive 
Gaussian noise with mean /i = 0 and variance a 1 = 1 and all 
results have been obtained using 5 = 0 such that 9\n\ =0 if 
x(t rL ) < r.'ih• The results are only presented for OR, AND and 


MAJORITY rules - special variations of the A-OUT-OF-A', 
as stated in Theorem 2 and Corollary 1. It is worth mentioning 
that, although this setting is somehow arbitrary, the analytic 
framework proposed here can be applied to different scenarios; 
the computational experiments presented in this section are 
coded in python language using IPython framework and are 
available at [28]. 

Before starting, we would like to explain our choice of 
the input signal x(t). Our idea was to have a positive de¬ 
terministic signal, limited in amplitude, that has different 
peaks and a visually “interesting” behavior. This allows us 
to see non-linear effects of changes in x t h. Our choice x(t) = 
sin(12Trt/r])+cos(20TTt/r])+sm(26Trt/r]) + 3, whose amplitude 
varies from 0 to 6 is illustrated in the first plot of Fig. 4, but 
for r) = 300 and assuming M = 1 and pi = 0.1, TV = 3 with 
additive Gaussian noise (/x = 0, a 2 = 1). 

The top plot represents the signal x(t) while the actual 
system state 9[n] associated with the event x(t n ) > Xth 
is shown next. We present in the third plot the estimations 
Xi(t n ) from the three sensors based on their noisy version 
of x(t ), followed by their respective states yi[n\. The three 
received signals Si[n] at the fusion center, after passing 
through the communication link, are presented next. The last 
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Fig. 5. Average error probability P c given in Theorem 2 as a function of the 
number of sensors N for OR, AND and MAJORITY decision rules, threshold 
x t h = 4.5, M = 1 and p \ = 0.1. The total number of samples is 77 = 10 4 
where 770 = 8747 and rp = 1253. 



Fig. 6. Average error probability P c given in Theorem 2 as a function of the 
number of sensors N for OR, AND and MAJORITY decision rules, threshold 
at t h = 5.5, M = 1 and p\ = 0.1. The total number of samples is // — 10 4 
where 770 = 9790 and p \ = 210. 

plots represent the decisions using OR, AND and MAJORITY. 

Table II compares the simulated and analytic error proba¬ 
bilities for this example, which for better visualization only 
considers r) = 300. Notice that for this particular example 
AND rule has better performance compared with the other 
two. As we shall see later, AND rule leads to small error 
probabilities when the number of sensors is low. 

Fig. 5 illustrates how the average error probability P e using 
OR, AND and MAJORITY rules varies with the number N of 
sensors making measurements when the threshold that defines 
the event £ is a;* = 4.5. The plot shows that: (?) OR rule has 
a high error probability, which increases with N, ( 11 ) AND 
rule has the best performance for small values of N, and (Hi) 


TABLE II 

Error probability for the snapshot presented in Fig. 4 


Decision rule 

Analytic 

Simulation 

9 

Pe 

Pe (77 = 300) 

Pe (V = 10 b ) 

OR 

0.383 

0.366 

0.381 

AND 

0.101 

0.090 

0.101 

MAJORITY 

0.129 

0.136 

0.125 


Fig. 7. Average error probability P c given in Theorem 2 as a function of the 
number of sensors N for OR, AND and MAJORITY decision rules, threshold 
Xth = 3, M = 1 and p \ = 0.1. The total number of samples is 77 = 10 4 
where 770 = 4992 and 771 = 5008. 

MAJORITY is the best choice when N grows such that P e 
tends to 0. To analyze these facts, we need to deal with 9[n]. 
From Theorem 2 and Corollary 2, the frequency of 0 [ n \ = 0 
and 9 [n] = 1 determines the performance of OR and AND so 
the former outperforms when 9[n] =0 is more frequent, and 
vice-versa. In our example, the frequencies are: / 0 = 0.8747 
and fi = 0.1253, providing their asymptotic limit. 

For MAJORITY, the asymptotic performance is independent 
of such frequencies. When a small number of sensors is 
considered, however, it does not provide the best performance 
since an error in more than N /2 signal is not rare. In this case, 
AND is the best, even working below its asymptotic limit. This 
happens due to the way that the logic operation AND works, 
balancing the error events when 9\n\ = 0 and 6\n\ = 1. 

To get more insights on the system performance. Figs. 6, 7 
and 8 show the average error probability as a function N for 
x lh = 5.5 (f 0 = 0.979 and h = 0-021), x th = 3 (/ 0 = 0.4992 
and /i = 0.5008) and x th = 1.5 (/ 0 = 0.1128 and 
/i = 0.8872), respectively. In the scenario where x t h = 5.5, a 
similar behavior to the x± = 4.5 is observed, but with the 
AND rule having a better performance due to the smaller 
frequency f 1 of events 0\n\ = 1. On the other hand, when 
x* = 1-5, the performance of the OR and AND rules switches 
in relation to when x* = 4.5 as far as the frequencies /o 
and /i have also switched; now OR works better because 
9\n\ = 0 is much more frequent. When the 9[n] is more 
evenly distributed, illustrated in the scenario where x* = 3, 
AND and OR are equivalent and their error probability tends 
to 0.5 (which is basically a random guess of the input state). 
In all scenarios, the MAJORITY rule maintains its asymptotic 
optimal performance, working better and better when the 
number of sensors grows. 

Fig. 9 presents the effects of the number of hops on the 
average error probability for x t h = 4.5 and N = 3, considering 
that every one of the M hops is modeled as a binary symmetric 
channel with the same error probability p = 0.1 so that the 
equivalent channel has the error probability given by Corollary 
2. As one would expect, the increase of the number of hops 
M also increases the average error probability, regardless of 


















































Fig. 8 . Average error probability P c given in Theorem 2 as a function of the 
number of sensors N for OR, AND and MAJORITY decision rules, threshold 
x t h = 1.5, M = 1 and p \ = 0.1. The total number of samples is 77 = 10 4 
where 770 = 1128 and r/i = 8872. 



Fig. 9. Average error probability P c given in Theorem 2 as a function of the 
number of hops M considering pj — p = 0.1 for OR, AND and MAJORITY 
rules, threshold = 4.5 and N = 3. The equivalent channel probability 
after M hops is given in Corollary 2. 

the decision rule. For the setting considered here, the AND 
rule seems more robust against the increase of M, so that the 
error probability associated to it grows slower than the other 
two options. Once again, this fact occurs due to the way AND 
balances the error probabilities for the two possible values of 
6[n], given more weight (in proportion to its occurrence) to 
less frequent error events. The OR rule, on the other hand, 
has an overall poor performance because it balances the error 
probability in the opposite way, which leads to even worse 
error probabilities. The MAJORITY rule appears to be more 
susceptible to the increase of M than AND, which indicates 
that the increase of the equivalent channel error probability 
when M grows seems to affect more the former. 

To better understand this fact, we present in Fig. 10 how 
the average error probability varies with the channel error 
probability p\ when x t h = 4.5, M = 1 and N = 3. We 
now can see clearer that the MAJORITY rule is indeed more 
susceptible to worse channel conditions than the AND rule 
for the scenario under analysis. As presented in Theorem 2, 
MAJORITY, different from the latter rule, does not favor the 


— OR 

— AND 

— MAJ 



P 


Fig. 10. Average error probability P e given in Theorem 2 as a function of 
channel error probability pi considering only 1-hop transmission (i.e. M = 1) 
for OR, AND and MAJORITY rules, threshold x t h = 4.5 and N = 3. 

error events in different ways so that when the channel error 
probability increases, it will increase in the same proportion 
for both the more and less frequent states. Consequently, 
although MAJORITY asymptoticly outperforms the others in 
terms of the number of sensors, it is much more vulnerable to 
an increase of the channel error probabilities. 

V. Final remarks 

In this paper, we analyzed different ways that one could 
design a relatively simple WSN based on three phases, namely 
sensing, communication and decision. Different from the 
literature of distributed sensing and estimation, we targeted 
at implementing simple decision rules, regardless of their 
optimality. Our idea here was to show that it is possible to 
attain low error probabilities using a simple threshold based 
quantizer, a limited bandwidth of 1 bit and low complexity 
decision rules such as AND, OR and MAJORITY. 

If the occurrence of the event is rare and associated with 
the state “1”, the AND rule can lead to a low error probability 
with a small number of sensors, although MAJORITY can 
asymptotically reach 0-error probability for a large the number 
of sensors. If its occurrence is more equally distributed, then 
AND and OR rules have a poor performance while the 
MAJORITY is better than the other options but still requiring 
a relatively large number of sensors. We also show that the 
MAJORITY rule is more susceptible to channel errors than the 
AND rule, reflecting the way that it balances the error events 
with the input state frequencies. 

In any case, our results indicate a simple and cheap way to 
implement a WSN when the application does not have strict 
requirements. Our plan is to extend these results by consider¬ 
ing erasure channels as in [14] and advanced relay strategies 
as in [29], [30], assuming the nodes follow a specific spatial 
distribution as in [19]. Sensors experiencing different input 
signals (e.g. smart-metering or sensors in different rooms) 
also constitute an interesting extension of the present work. 
Other promising direction is to individually analyze the types 
I and II error probabilities such keeping one of them as a fixed 
target (e.g. applications that require a very low false negative 
probability, while false positives are unconstrained). 
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